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DETAILED ACTION 

1. The Amendment filed on June 8. 2006 has been received and entered. Claims 
1-88 are pending 

2. Applicant's amendment has overcome previous claim rejection under 112 2"^. 

Specification 

3. The disclosure is objected to because of following infonrialities: in the section 
titled Detailed Description of Exemplary Embodiments on page 34 in paragraph 0129 
the recitation "later developed" draws toward the unpredictability of the future. This is 
not clear and concise language. 

Claim Objections 

4. Claim 84 recites the word "instructions for*' in the body of the claims. It indicates 
intended use and as such does not carry patentable weight. The word could be 
changed to recite "instructions to". Applicant is required to amend the claims so that the 
claim limitations are recited in a definite form. 

Double Patenting 

5. The nonstatutory double patenting rejection is based on a judicially created doctrine grounded in public 
policy (a policy reflected in the statute) so as to prevent the unjustified or improper timewise extension of the 
"right to exclude" granted by a patent and to prevent possible harassment by multiple assignees. A 
nonstatutory obviousness-type double patenting rejection is appropriate where the conflicting claims are not 
identical, but at least one examined application claim is not patentably distinct from the reference claim(s) 
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because the examined application claim is either anticipated by, or would have been obvious over, the 
reference claim(s). See, e.g.. In re Berg, 140 F.3d 1428, 46 USPQ2d 1226 (Fed, Cir. 1998); In re Goodman, 11 
F.3d 1046, 29 USPQ2d 2010 (Fed. Cir. 1993); In re Longi, 759 F.2d 887, 225 USPQ 645 (Fed. Cir. 1985); In re 
Van Ornum. 686 F.2d 937, 214 USPQ 761 (CCPA 1982); In re Vogel, 422 F.2d 438, 164 USPQ 619 (CCPA 
1970); and In re Thorington, 418 F.2d 528, 163 USPQ 644 (CCPA 1969). 

A timely filed terminal disclaimer in compliance with 37 CFR 1.321(c) or 1.321(d) may be used to 
overcome an actual or provisional rejection based on a nonstatutory double patenting ground provided the 
conflicting application or patent either is shown to be commonly owned with this application, or claims an 
invention made as a result of activities undertaken within the scope of a joint research agreement. 

Effective January 1, 1994, a registered attorney or agent of record may sign a terminal disclaimer. A 
tenninal disclaimer signed by the assignee must fully comply with 37 CFR 3.73(b). 



6. Claims 1, 20, 39, 58, 81-84 are provisionally rejected on the ground of 
nonstatutory obviousness-type double patenting as being unpatentable over claims 1 , 
16, 31-32 of copending Application No. 10/626.856. Although the conflicting claims are 
not identical, they are not patentably distinct from each other because the claims use 
determining steps that are clearly similar. For example in claim 1 of the instant 
application applicant states "determining source-identified training stories", in claim 1 of 
application 10/626,856 applicant states "determining a source-identified story corpus, 
each story associated with at least one event". In effect both claims state the same 
thing. Other steps in reminder of the claims follow the same reasoning. 

This is a provisional obviousness-type double patenting rejection because the 
conflicting claims have not in fact been patented. 



Claim Rejections - 35 USC § 101 

7. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 
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8. Claims 1 ,20. 39, 58. 77 and 81-84 are rejected under 35 U.S.C. 101 because the 
claimed invention is directed to non-statutory subject matter. 

Claims 1, 39, 77, 82 and 84 list computational steps in a program (i.e. software 
per se) without tangible, useful, concrete result. The claims do not list any hardware 
(i.e. computer) tied to the method steps in the body of the claim in order to realize the 
"determining" functionality. 

Claims 1. 20, 39, 58, 77 and 81-84 list computational steps in a program without 
tangible, useful, concrete result. The claims do not have any visible result or output. The 
steps of "determining" are missing real world result. Indicating doesn't actually have 
show or output the result of determination. There needs to be an outputting of the link 
or storing for further use. 

Claims 81-84 all state the intended use by use of word "useable". To overcome 
this type of rejection, claims could be amended to recite definite functionality (i.e. 
executed" or "processed" or "to perfomri") 

Claim Rejections - 35 USC § 102 

9. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 
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(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351(a) shall have the effects for purposes of this subsection of an application filed in the United States 
only if the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

10. Claim 1-5, 9-10,14-24, 28-29, 33-43, 47-48. 52-62, 66-67, 71-76 and 81-88 are 
rejected under 35 U.S.C. 102(e) as being anticipated by Sundaresan et al. (U.S. Patent 
6,606,620 B1). 

As per claim 1 Sundaresan et al. is directed to a computer-implemented method 
of detennining predictive models for a linked event detection system comprising the 
steps of: 

detennining source-identified training stories (column 3, lines 16-17, wherein 
"stories" means "documents"); 

detemnining inter-story similarity vectors for at least one story-pair (column 4, 
lines 23-24, wherein "similarity vectors" mean "similar terms"); 

determining link label information for the at least one story-pair (column 9, lines 

8-9); 

determining at least one predictive model based on the inter-story similarity 
vector and the link label information (column 10, lines 5-13); and 

indicating a link between other story pairs based on the predictive model and the 
inter-story similarity vector (column 7, line 67; column 8, lines 1-4). 
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As per claim 2 Sundaresan et aL is directed to a step of determining inter-story 
similarity vectors comprises the steps of: 

detemriining at least one inter-story similarity metric for the story-pairs (column 4, 
lines 9-25); 

and determining at least one source-pair statistics for the at least one story-pair 
(column 10, lines 15-17). 

As per claim 3 Sundaresan et al. is directed to a determining inter-story similarity 
vectors further comprise the step of normalizing the inter-story similarity metric based 
on the source-pair statistics (column 10, lines 17-22). 

As per claim 4 Sundaresan et aL is directed to a determining inter-story similarity 
vectors further comprise the step of incrementally normalizing the inter-story similarity 
metric based on the source-pair statistics (column 10, lines 16-22). 

As per claim 5 Sundaresan et al. is directed to the inter-story similarity metric is 
normalized based on at least one of subtraction and division (column 8, lines 22-27). 

As per claim 9 Sundaresan et al. is directed to a comprising the step of 
transforming the source-identified training stories (column 1, line 63, wherein the 
"training stories" are in English). 
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As per claim 10 Sundaresan et al. is directed to transforming tlie source- 
identified training stories is at least one of translating, transcribing and linguistically 
transforming (column 1 , line 63; column 2, line 43, wherein the HTML and XML are in 
English, therefore translation will not be necessary). 

As per claim 14 Sundaresan et al. is directed to at least one inter-story similarity 
metric is normalized based on at least one of a source-pair identified similarity statistic 
(column 10, lines 15-17). 

As per claim 15 Sundaresan et al. Is directed to at least one predictive model is 
at least one of: a classifier, a support vector machine, a decision tree and a Naive- 
Bayes classifier (column 3, lines 13-14). 

As per claim 16 Sundaresan et al. is directed to at least one of the source-pair 
similarity statistics are detemnined based on a source hierarchy (column 3, lines 50-51). 

As per claim 17 Sundaresan et al. is directed to the source hierarchy is 
detemnined based on at least one source characteristic (column 3, lines 61-65, wherein 
"characteristic" means "lean- 



As per claim 18 Sundaresan et al. is directed to the source characteristic is at 
least one of a language characteristic, an input mode characteristic, a genre 
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characteristic, a source name characteristic and a transformation characteristic (column 
3, lines 54-60, wherein language characteristic" means how the words in a document 
are related to each other). 

As per claim 19 Sundaresan et aL is directed to the source-pair similarity statistic 
for a new source is determined based on at least one source characteristic of the new 
source (column 3, lines 50-53, wherein each new source has different hierarchy). 

As per claim 20 Sundaresan et al. is directed to a linked event detection training 
system comprising: 

an input/output circuit (column 7, lines 34-35, wherein it is inherent for computer 
to have input/output device circuit); 

a memory (column 7, lines 34-35, wherein it is inherent for computer to have 
memory); 

a processor that receives source-identified training stories and associated link 
label information for at least one story-pair via the input/output circuit (column 7. lines 
34-35, wherein it is inherent for computer to have a processor); 

an inter-story similarity vector determining circuit that determines an inter-story 
similarity vector for at least one story-pair (column 4, lines 23-24, wherein "similarity 
vectors" mean "similar terms"); 

and a predictive model determining circuit that determines at least one predictive 
model based on the inter-story similarity vector and the link label information (column 
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10, lines 5-13) and which indicates a link between other story pairs based on the 
predictive model and the inter-story similarity vector (column 7, line 67; column 8, lines 
1-4). 

As per claim 21 Sundaresan et al. is directed to the inter-story similarity vector 
determining circuit is comprised of: 

a similarity metric determining circuit that determines at least one inter-story 
similarity metric for the at least one story-pair (column 4, lines 9-25); 

and a similarity statistics detemiining circuit that determines at least one source- 
pair statistic for the at least one story-pair (column 10, lines 15-17). 

As per claim 22 Sundaresan et al. is directed to the inter-story similarity vector 
detemiining circuit nomrializes the inter-story similarity metric based on the source-pair 
statistics (column 10, lines 17-22). 

As per claim 23 Sundaresan et al. is directed to the inter-story similarity vector 
detemnining circuit incrementally normalizes the inter-story similarity metric based on 
the source-pair statistics (column 10, lines 16-22). 

As per claim 24 Sundaresan et al. is directed to at least one of the inter-story 
similarity metrics is nomiallzed based on at least one of a subtraction and a division 
operation (column 8, lines 22-27). 
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As per claim 28 Sundaresan et al. is directed to a comprising the step of 
transforming the source-identified training stories (column 1. line 63, wherein the 
"training stories" are in English). 

As per claim 29 Sundaresan et al. is directed to transforming the source- 
identified training stories is at least one of translating, transcribing and linguistically 
transfomriing (column 1, line 63; column 2, line 43, wherein the HTML and XML are in 
English therefore translation will not be necessary). 

As per claim 33 Sundaresan et al. is directed to the at least one inter-story 
similarity metric is normalized based on at least one of a source-pair identified similarity 
statistic (column 10, lines 15-17). 

As per claim 34 Sundaresan et al. is directed to the at least one predictive model 
is at least one of: a classifier, a support vector machine, a decision tree and a Naive- 
Bayes classifier (column 3, lines 13-14). 

As per claim 35 Sundaresan et al. is directed to the source-pair identified 
similarity statistic is determined based on a source hierarchy (column 3, lines 50-51). 
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As per claim 36 Sundaresan et al. is directed to the source hierarchy is 
determined based on at least one of a source characteristic (column 3, lines 61-65, 
wherein "characteristic" means "leaf). 

As per claim 37 Sundaresan et al. is directed to the source characteristic is at 
least one of a language characteristic, ^ input mode characteristic, a genre 
characteristic, a source name characteristic and a transfomriation characteristic (column 
3, lines 54-60, wherein "language characteristic" means how the words in a document 
are related to each other). 

As per claim 38 Sundaresan et al. is directed to the source-pair similarity statistic 
for a new source is determined based on at least one source characteristics of the new 
source (column 3, lines 50-53, wherein each new source has different hierarchy). 

As per claim 39 Sundaresan et al. is directed to a computer-implemented method 
of linked event detection comprising the steps of: 

detemiining source-identified training stories (column 3, lines 16-17, wherein 
"stories" means "documents"); 

determining inter-story similarity vectors for the story-pairs (column 4, lines 23- 
24, wherein "similarity vectors" mean "similar terms"); 

determining at least one predictive model for link detection (column 10, lines 5- 

13); 
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and determining a link between the story-pairs based on the predictive model 
and the inter-story similarity vector (column 10, lines 5-13, wherein sorting detenmines 
the link); and 

indicating the link (column 7, line 67; column 8, lines 1-4). 

As per claim 40 Sundaresan et al is directed to a step of detennining inter-story 
similarity vectors comprises the steps of: 

determining at least one inter-story similarity metric for each story-pair (column 4, 
lines 9-25); 

and determining source-pair statistics for the story-pairs (column 10, lines 15- 

17). 

As per claim 41 Sundaresan et al. is directed to a determining inter-story 
similarity vectors further comprise the step of normalizing the inter-story similarity metric 
based on the source-pair statistics (column 10, lines 17-22). 

As per claim 42 Sundaresan et al. is directed to a determining inter-story 
similarity vectors further comprise the step of incrementally normalizing the inter-story 
similarity metric based on the source-pair statistics (column 10, lines 16-22). 



As per claim 43 Sundaresan et al. is directed to the inter-story similarity metric is 
normalized based on at least one of subtraction and division (column 8, lines 22-27). 
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As per claim 47 Sundaresan et al. is directed to a comprising the step of 
transforming the source-identified training stories (column 1, line 63, wherein the 
"training stories" are in English). 

As per claim 48 Sundaresan et al. is directed to transforming the source- 
identified training stories is at least one of translating, transcribing and linguistically 
transfomriing (column 1, line 63; column 2, line 43, wherein the HTML and XML are in 
English therefore translation will not be necessary). 

As per claim 52 Sundaresan et al. is directed to the at least one inter-story 
similarity metric is normalized based on at least one of a source-pair identified similarity 
statistic, (column 10, lines 15-17). 

As per claim 53 Sundaresan et al. is directed to the at least one predictive model 
is at least one of: a classifier, a support vector machine and a decision tree, a Naive- 
Bayes-classifier (column 8, lines 22-27). 

As per claim 54 Sundaresan et al. is directed to the source-pair similarity statistic 
for a new source is determined based on at least one source characteristics of the new 
source (column 3, lines 50-51). 
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As per claim 55 Sundaresan et al. is directed to the source hierarchy is 
determined based on at least one of a source characteristic (column 3, lines 61-65, 
wherein "characteristic" means "(ear). 

As per claim 56 Sundaresan et al. is directed to the source characteristic is at 
least one of a language characteristic, an input mode characteristic, a genre 
characteristic, a source name characteristic and a transfomiation characteristic (column 
3, lines 54-60. wherein "language characteristic" means how the words in a document 
are related to each other). 

As per claim 57 Sundaresan et al. is directed to the source-pair similarity statistic 
for a new source is determined based on at least one source characteristics of the new 
source (column 3, lines 50-53, wherein each new source has different hierarchy). 

As per claim 58 Sundaresan et al. is directed to linked event detection system 
comprising: 

an input/output circuit (column 7, lines 34-35, wherein it is inherent for computer 
to have input/output device circuit); 

a memory (column 7, lines 34-35, wherein it is inherent for computer to have 
memory); 

a processor that receives source-identified training stories via the input/output 
circuit (column 7, lines 34-35, wherein it is inherent for computer to have processor); 
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an inter-story similarity vector detennining circuit that detemriines inter-story 
similarity vectors for the story-pairs (column 4, lines 23-24, wherein "similarity vectors" 
mean "similar terms"); 

and a link detemnining circuit that determines and indicates links between story- 
pairs based on a predictive model and the inter-story similarity vectors (column 10, lines 
5-13, wherein sorting determines the link; column 7, line 67; column 8. lines 1-4). 

As per claim 59 Sundaresan et al. is directed to the inter-story similarity vector 
determining circuit is comprised of: 

a similarity metric determining circuit that determines at least one inter-story 
similarity metric for the story-pairs (column 4, lines 9-25); 

and a similarity statistics determining circuit that determines source-pair statistics 
for the story-pairs (column 10, lines 15-17). 

As per claim 60 Sundaresan et al. is directed to the inter-story similarity vector 
determining circuit nomializes the inter-story similarity metric based on the source-pair 
statistics (column 10, lines 17-22). 

As per claim 61 Sundaresan et al. is directed to the inter-story similarity vector 
determining circuit incrementally normalizes the inter-story similarity metric based on 
the source-pair statistics (column 10, lines 16-22). 
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As per claim 62 Sundaresan et al, is directed to at least one of the inter-story 
similarity metrics is normalized based on at least one of a subtraction and a division 
operation (column 8, lines 22-27). 

As per claim 66 Sundaresan et al. is directed to a comprising the step of 
transfomning the source-identified training stories (column 1 , line 63, wherein the 
"training stories" are in English). 

As per claim 67 Sundaresan et al. is directed to transforming the source- 
identified training stories is at least one of translating, transcribing and linguistically 
transforming (column 1, line 63; column 2, line 43, wherein the HTML and XML are in 
English therefore translation will not be necessary). 

As per claim 71 Sundaresan et al. is directed to the at least one inter-story 
similarity metric is nomnalized based on at least one of a source-pair identified similarity 
statistic (column 10, lines 15-17). 

As per claim 72 Sundaresan et al. is directed to the predictive model is at least 
one of: a classifier, a support vector machine and a decision tree, a Naive-Bayes 
classifier (column 8, lines 22-27). 
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As per claim 73 Sundaresan et al. is directed to the source-pair identified 
similarity statistic is determined based on a source hierarchy (column 3, lines 50-51). 

As per claim 74 Sundaresan et al is directed to the source hierarchy is 
determined based on at least one of a source characteristic (column 3, lines 61-65, 
wherein "characteristic" means "leaf). 

As per claim 75 Sundaresan et al. is directed to the source characteristic is at 
least one of a language characteristic, an input mode characteristic, a genre 
characteristic, a source name characteristic and a transformation characteristic (column 
3, lines 54-60, wherein "language characteristic" means how the words in a document 
are related to each other). 

As per claim 76 Sundaresan et al. is directed to the source-pair similarity statistic 
for a new source is determined based on at least one source characteristics of the new 
source (column 3, lines 50-53, wherein each new source has different hierarchy). 

As per claim 81 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code usable to program a 
computer to determine at least one predictive model for a linked event detection system 
comprising the steps of: 
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determining source-identified training stories (column 3, lines 16-17, wherein 
"stories" means "documents"); 

detemilnlng inter-story similarity vectors for at least one story-pair (column 4, 
lines 23-24, wherein "similarity vectors" mean "similar terms"); 

determining link label information for the at least one story-pair (column 9, lines 

8-9); 

and determining at least one predictive model based on the inter-story similarity 
vector and the link label information (column 10, lines 5-13); and 

indicating a link between other story pairs based on the predictive model and the 
inter-story similarity vector (column 7, line 67; column 8, lines 1-4). 

As per claim 82 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code usable to program a 
computer to determine at least one predictive model for a linked event detection system 
comprising: 

instructions for detemiining source-identified training stories (column 3, lines 16- 
17, wherein "stories" means "documents"); 

instructions for detemiining inter-story similarity vectors for at least one story-pair 
(column 4, lines 23-24, wherein "similarity vectors" mean "similar terms"); 

instructions for detemiining link label information for the at least one story-pair 
(column 9, lines 8-9); 
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and instructions for determining at least one predictive model based on the inter- 
story similarity vector and the link label infomnation (column 10, lines 5-13); and 
instructions for indicating a link between other story pairs based on the predictive 
model and the Inter-story similarity vector (column 7, line 67; column 8, lines 1-4). 

As per claim 83 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code usable to program a 
computer to detect linked events comprising the steps of: 

determining source-identified training stories (column 3, lines 16-17, wherein 
"stories" means "documents"); 

detemnining inter-story similarity vectors for the at least one story-pair (column 4, 

lines 23-24, wherein "similarity vectors" mean "similar terms"); determining at 

least one predictive model for link detection (column 9, lines 8-9); 

detennining a link between story-pairs based on the at least one predictive model 
and the inter-story similarity vector (column 10, lines 5-13); and 

indicating the link (column 7, line 67; column 8, lines 1-4). 

As per claim 84 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code usable to program a 
computer to detect linked events comprising the steps of: 
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instructions for detennining sourceHdentified training stories (column 3, lines 16- 
17. wherein "stories" means "documents"); 

instructions for detennining inter-story similarity vectors for the at least one story- 
pair (column 4, lines 23-24, wherein "similarity vectors" mean "similar terms"); 

instructions for determining at least one predictive model for link detection 

(column 9, lines 8-9); 

instructions for detennining a link between story-pairs based on the predictive 
model and the inter-story similarity vector (column 10, lines 5-13); ); and 
indicating the link (column 7, line 67; column 8, lines 1-4). 

As per claims 85 and 86 Sundaresan et aL is directed to determining at least one 
source-pair statistic for the at least one story-pair is based on at least one of a similarity 
metric and a statistic associated with the metric (column 3, lines 25-29, wherein the 
statistical algorithm uses metric for the computations). 

As per claims 87 and 88 Sundaresan et al. is directed to at least one of the 
predictive models is a trained predictive model (column 10, lines 29-33, wherein the 
"trained predictive model" is determined by use of statistical model). 

1 1 . The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 
A person shall be entitled to a patent unless - 
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(a) the invention was known or used by others in this country, or patented or described in a printed 
publication in this or a foreign country, before the invention thereof by the applicant for a patent. 

12. Claim 77-80 are rejected under 35 U.S.C. 102(a) as being anticipated by Brown. 
Ralf D. "Dynamic Stopwording for Story Link Detection", (hereafter referred as Brown) . 

As per claim 77 Brown is directed to a method of determining a stopword list 
comprising the steps of: 

determining a source-identified training corpus of text information (page 1 , 
column 2, lines 26-29); 

detennining a verified first transformation of the source-identified training corpus 
text from a first source mode to a second source mode (page 1 , column 2, lines 26-29; 
page 1 column 2, lines 33-40, wherein the "transformation" would be the "single-pass 
incremental clustering method"); 

detennining an un-verified second transformation of the source-identified training 
corpus text from a first source mode to a second source mode (page 1 , column 2, lines 
17-18, wherein "un-verified" means any "story from a newswire"); 

determining at least one transformation errors associated with distribution 
differences between the first and second transformations and identified sources (page 
2, column 2, lines 4-6); 

determining at least one source-specific transformation actions for the 
determined transfonnation errors (page 2. column 1, lines 2-6); and 
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identifying and transforming transformation errors in other transformed source- 
identified texts based on the source-specific transfonmation actions (page 2, column 1, 
lines 2-7). 

As per claim 78 Brown is directed to the first source mode is at least one of a text 
source, an optical character recognition source and an automatic speech recognition 
source (page 1 , column 2, lines 22-24). 

As per claim 79 Brown is directed to the second source mode is at least one of a 
text source, an optical character recognition source and an automatic speech 
recognition source (page 1, column 2, lines 22-24; page 2, column 1, lines 6-8). 

As per claim 80 Brown is directed to wherein the source-specific transformation 
is at least one of a removal, a repair and a normalization transformation (page 2, 
column 1 , lines 4-6). 

Claim Rejections - 35 USC § 103 

13. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the . 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the Invention was made. 
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14. Claims 6-8, 25-27, 44-46, and 63-65 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Sundaresan et al. (US Patent 6,606,620 B1) In view of Gange 
et al. (US 2004/006559 Al). 

As per claims 6, 25, 44 and 63 Sundaresan et aL fails to teach the use of 
probability based metric and a Euclidean based similarity metric. 

Gange et al. teaches the use of Euclidean distance (page 3, paragraph 0045). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Gange et al. to include 
the use of Euclidean distance as it is metrics often used in the database field to 
compute distances between similar terms. 

As per claims 7, 26, 45 and 64 Sundaresan et al. as modified fails to teach the 
use of similarity metric is at least one of a Hellinger, a Tanimoto and a clarity distance 
based metric. 

Gange et al. teaches the use of Tanimoto coefficient (page 3, paragraph 0045). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Gange et al. to include 
the use of Tanimoto coefficient as it is metrics often used in the database field to 
compute distances between similar terms. 
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A per claims 8, 27, 46 and 65 Sundaresan et al. as modified fails to teach the use 
of inter-story similarity metric is a cosine-distance based metric. 

Ganoe et al. teaches the use of Cosine coefficient (page 3, paragraph 0045). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Ganoe et al. to Include 
the use of Cosine coefficient as it is metrics often used in the database field to compute 
distances between similar temis 

15. Claims 11-13, 30-32, 49-51 and 68-70 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Sundaresan et al. (US Patent 6,606,620 B1) in view of Zhou 
(US 2004/0002849 Al). 

As per claims 1 1 , 30, 49 and 68 Sundaresan et al. fails to teach the inter-story 
^imilarify metrics are based on terms in at least one source-identified term frequency- 
inverse story frequency models. 

Zhou teaches the use of frequency-inverse (page 3, column 2, paragraph 0030, 
lines 9-11). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Zhou to include the use 
of frequency-inverse because it predicts effective example of sentence retrieval as 
stated on page 1 , column 1 , paragraph 0005 of Zhou. 
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As per claims 12, 37, 50 and 69 Sundaresan et aL as modified fails to teach the 
terms in source-identified term frequency-inverse story frequency models are based on 
language. 

Zhou teaches that the retrieved samples are to aid in writing or translation (page 
3, paragraph 0030, lines 2-4, wherein writing or translating has basis in language). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Zhou to include the 
inverse-frequency based on language because term comparison includes terms of a 
language. 

As per claims 13, 32, 51 and 70 Sundaresan et al. as modified fails to teach 
determining terms comprises the steps: determining a reference language; and 
determining reference language and non-reference language terms. 

Zhou teaches the changing of sample terms from one mode to another (page 3. 
paragraph 0032). 

It would have been obvious to on of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Zhou to include the 
determination of reference language since the correct translation requires the correct 
reference language. 



Application/Control Number: 10/626.875 Page 26 

Art Unit: 2165 

Response to Arguments 

16. Applicant's arguments filed 06/08/2006 have been fully considered but they are 
not persuasive. 

The objection to specification in paragraph 0129 of the specification remains 
valid. Even though the communication like is definite in regard that they connect two or 
more links together, the method in which they do so may contain non-statutory subject 
matter. 

As to applicant response to provisional double patenting also remains valid. The 
instant application is broader. The preamble is directed to linked events making 
predictions based on events that are part of the co-pending application. As stated in 
prior action there are similarities between in dependent claims that would merit double 
patenting issue. 

In regard of the remarks directed to Sundaresan et al. (US 6,606,620 B1) 
reference not teaching determining source-identified training stories is not deemed 
persuasive. If Applicant refers to source-identified specifically then the claim is broad 
enough to mean either the location or type of training stories. Sundaresan. teaches 
that the training documents are semi structured, meaning not reserved to particular 
format or source. The documents could be html or xml as stated in column 5 lines 50- 
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53 of Sundaresan . If the meaning is location then Sundaresan discloses that the 
document could be found on a World Wide Web identified by a URL as described in 
column 6 lines 48-49. To follow that the determining is never used, there is no result to 
the determination. The source-identified is not tied or used again in the body of the 
independent claim. 

As to the second argument directed to Sundaresan reference, that detemnining 
inter-story similarity vectors for at least one story-pair Is also found not persuasive. The 
claim language nor the arguments do not support or realize distinction between 
Sundaresan and claimed invention. Thus, it is broadly interpreted by the examiner on 
the cited reference. 

In regards of the remarks directed to Brown "Dynamic Stopwording for Story Link 
Detection" not teaching determining a verified first transformation of the source 
identified training corpus text from a first source mode to second source mode is not 
deemed persuasive. The claim language does not make it clear as to what first 
transformation is. It's not clear why the applicant believes Brown doesn't cover the 
claimed limitation. Brown broadly interpreted to search for particular stop words given 
weight and by removing the stopwords transforms the test. 

In regards of the remarks directed to Ganae et al. (US 2004/0006559 A1) 
reference is deemed not persuasive. Ganqe discloses the use of Euclidean metric. As 
to the features of claim 1 and other independent claims 20, 39 and 58 that the Applicant 
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states are not disclosed in the combination of Sundaresan and Ganqe the examiner 
addresses above why Sanderson's rejection is maintained. Therefore the 
questionability of Gange reference is directed only to the Euclidean limitation and it is 
covered. 

In regards of the remarks directed to Zhou (US 2004/0002849 A1) reference is 
deemed not persuasive. The reference points to frequency-inverse usage as stated by 
the limitation. In this regard Zhou covers the limitations of claims 11 , 30, 49 and 68. A 
document could be formed by just one sentence. The Applicant also made statement 
that claim 11 is an independent claim. Claim 1 1 is actually dependant on claims. The 
examiner believes that the combination of the references covers the limitations in 
question. 

Conclusion 

17. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of 
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the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Tomasz Ponikiewski whose telephone number is 
(571)272-1721. The examiner can nomrially be reached on 8:00-4:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Jeffrey A. Gaffin can be reached on (571)272-4146. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more infomiation about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-91 99 (IN USA OR CANADA) or 571-272-1000. 



Tomasz Ponikiewski 
September 2, 2006 
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